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We previously reported the detection of bat coronaviruses (bat CoVs 1A, IB, HKU7, HKU8 and 
bat-severe acute respiratory syndrome coronavirus) in Miniopterus spp. that cohabit a cave in 
Hong Kong. Here, we report the full genomic sequences of bat CoVs 1 A, 1 B and HKU8. Bat 
CoVs 1A and IB, which are commonly found in the Miniopterus , are phylogenetically closely 
related. Using species-specific RT-PCR assays, bat CoVs 1A and 1 B were confirmed to have 
distinct host specificities to Miniopterus magnater and Miniopterus pusillus, respectively. 
Interestingly, co-infections of bat CoVs 1 B and HKU8 in M. pusillus are detected in seven of 38 
virus-positive specimens collected from 2004 to 2006. These findings highlight that co-infections 
of some coronaviruses might be common events in nature. The biological basis for the host 
restriction of bat coronaviruses, however, is yet to be determined. 


Coronavirus (CoV) is a genus of viruses in the family 
Coronaviridae under the order Nidovirales. Based on 
antigenic and genetic analyses, CoVs are divided into three 
major groups. The viruses have positive-sense, single- 
stranded RNA genomes with sizes ranging from 27000 to 
32000 nt (Gorbalenya et al. , 2006). Their genomes are 
polycistronic and contain at least five major open reading 
frames (ORFs) with the characteristic gene order [5'- 
replicase (rep), spike (S), envelope (E), membrane (M) and 
nucleocapsid (N)-3']. The severe acute respiratory syn¬ 
drome (SARS) outbreak in 2003 highlighted the potential 
health threat of coronaviruses to humans (Peiris et al., 
2003). This prompted an intensive search for the precursor 
of SARS CoV in wildlife and the discovery of several novel 
CoVs in bats in China, including the viruses that are closely 
related to SARS CoV (Dong et al., 2007; Lau et al, 2005; 
Poon et al, 2005; Tang et al, 2006; Woo et al, 2006). 
Recently, the biological importance of bats for the ecology 
of coronaviruses was further reiterated by the discovery of 
novel bat CoVs (BtCoVs) in other continents (Dominguez 
et al, 2007; Muller et al, 2007). 

We previously reported the discovery of the first BtCoV 
(Poon et al, 2005). In addition, we also performed a 
longitudinal study on Miniopterus magnater and 
Miniopterus pusillus bats that cohabit an abandoned cave 

The GenBank/EMBL/DDBJ accession numbers for the sequences 
reported in this paper are EU420137-EU420139. 

Supplementary material is available with the online version of this paper. 


in Hong Kong (Chu et al, 2006). BtCoV HKU7 and bat- 
SARS-CoV were detected only once in our specimens, 
while BtCoVs 1A, IB and HKU8 were detected repeatedly 
in these bats. A clear host species restriction was observed 
with BtCoV 1A being found in M. magnater and BtCoV IB 
in M. pusillus even though these two CoVs are genetically 
very similar and the two bat species co-inhabit the same 
cave. Thus, these virus surveillance studies might allow 
unique opportunities to understand better the dynamics 
and prevalence of coronaviruses within a single geograph¬ 
ical location. To understand better the genetic relationships 
of the BtCoVs that are repeatedly found in the local 
Miniopterus spp, representative faecal samples positive for 
BtCoVs 1A, IB and HKU8 in our previous investigations 
were selected for this study. Genomic sequences of these 
three viruses were determined and analysed as described in 
Supplementary Material (available in JGV Online). 

The genomes of BtCoVs 1A, IB and HKU8, excluding the 
3' poly(A) tails, have sizes of 28309, 28460 and 28757 nt, 
respectively (GenBank accession nos EU420137- 
EU420139). The G + C content of 1A, IB and HKU8 is 
38.2, 38.6 and 41.8%, respectively. The genome organiza¬ 
tions of these CoVs are typical of group 1 CoVs (Table 1) 
and the amino acid sequence identities of the major viral 
proteins between these viruses are shown in Supplementary 
Table SI (available in JGV Online). The 3' UTRs and N 
genes of these viruses were reported previously and will not 
be discussed further (Chu et al, 2006). Interestingly, HKU8 
was found to contain an extra ORF at its 3' end. This 
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Table 1 . Predicted ORFs in the genomes of BtCoV 1A (strain AFCD62), BtCoV IB (strain AFCD307) and BtCoV PIKU8 (strain 
AFCD77) 


The nucleotide positions for nsp 1-16 are also shown in the table. NA, Not applicable 



BtCoV 1A 

BtCoV IB 

BtCoV HKU8 


Position 

Length (nt) 

Position 

Length (nt) 

Position 

Length (nt) 

ORFla 

272-13078 

12804 

273-13247 

12972 

270-12965 

12693 

ORFlb 

13048-21069 

8019 

13217-21238 

8019 

12935-20959 

8022 

nspl (ppla/pplab) 

272-859 

588 

273-860 

588 

270-323 

54 

nsp2 (ppla/pplab) 

860-2950 

2091 

861-2948 

2088 

324-1811 

1488 

nsp3 (ppla/pplab) 

2951-8290 

5340 

2949-8459 

5511 

1812-8174 

6363 

nsp4 (ppla/pplab) 

8291-9712 

1422 

8460-9881 

1422 

8175-9599 

1425 

nsp5 (ppla/pplab) 

9713-10618 

906 

9882-0787 

906 

9600-10505 

906 

nsp6 (ppla/pplab) 

10619-11455 

837 

10788-11624 

837 

10506-11342 

837 

nsp7 (ppla/pplab) 

11456-11704 

249 

11625-11873 

249 

11343-11591 

249 

nsp8 (ppla/pplab) 

11705-12286 

582 

11874-12455 

582 

11592-12173 

582 

nsp9 (ppla/pplab) 

12287-12616 

330 

12456-12785 

330 

12174-12503 

330 

nsplO (ppla/ 
pplab) 

12617-13024 

408 

12786-13193 

408 

12504-12911 

408 

nspll (ppla) 

13025-13078 

54 

13194-13247 

54 

12912-12965 

54 

nspl2 (pplab) 

13025-15804 

2781 

13194-15973 

2781 

12912-15691 

2781 

nspl3 (pplab) 

15805-17595 

1791 

15974-17764 

1791 

15692-17482 

1791 

nspl4 (pplab) 

17596-19149 

1554 

17765-19318 

1554 

17483-19039 

1557 

nspl5 (pplab) 

19150-20166 

1017 

19319-20335 

1017 

19040-20056 

1017 

nspl6 (pplab) 

20167-21066 

900 

20336-21235 

900 

20057-20956 

900 

S 

21071-25198 

4125 

21240-25397 

4155 

20956-25083 

4125 

ORF3 

25198-25857 

657 

25366-26025 

657 

25083-25751 

666 

E 

25851-26075 

222 

26019-26243 

222 

25835-25959 

222 

M 

26082-26840 

756 

26250-27017 

765 

25966-26721 

753 

N 

26861-28030 

1167 

27038-28189 

1149 

26770-28038 

1266 

ORF7 

NA 

- 

NA 

- 

27788-28534 

744 


predicted ORF (designated ORF 7) overlaps with the N 
gene and encodes a putative protein with 248 aa residues 
(Table 1). 

The size of the 5' UTR of 1A, IB and HKU8 is 271, 272 and 
269 nt, respectively. Three stem-loop structures that were 
proposed to be conserved in group 1 CoVs (Liu et al., 
2007) and a core sequence of leader transcription 
regulatory sequence (5'-CUAAAC-3') were identified in 
these 5' UTR sequences (Supplementary Fig. SI available in 
JGV Online). 

These ORFs 1 occupy approximately 70 % of the genomes. 
ORF1 consists of ORFla and ORFlb that encodes viral ppla 
and pplab, respectively. Putative features responsible for the 
— 1 ribosomal frame shifting, e.g. the ‘slippage sequence’ (5'- 
UUUAAAC-3'), were identified in the genomes (data not 
shown). Based on in silico analysis, the ppla and pplab 
proteins are predicted to be cleaved into a total of 16 non- 
structural proteins by virus proteases (Table 1) (Chen et al., 
2003). The predicted 3CL protease cleavage sites in these 
polyproteins (Supplementary Table S2 available in JGV 
Online) agree with previously defined substrate specificity 
for other 3CL proteases (Ziebuhr et al., 2000). All pplab of 


BtCoV 1A, IB and FIKU8 contain functional units that are 
typical for CoVs. For example, all these viruses contain two 
papain-like (PL) protease domains in nsp3, a 3C-like (3CL) 
protease in nsp5, CoV RNA-dependent RNA polymerase 
(RdRp) in nsp 12, RNA helicase (Hel) in nsp 13 and an 
mRNA cap-1 methyltransferase (MT) functional domain in 
nspl6. It was noted that the putative nspl of HKU8 has a 
size of 18 aa, which is significantly smaller than sizes of nspl 
predicted in other group 1 CoVs. 

Similar to other CoVs, the N-terminal regions of these S 
genes are genetically diverse. The protein sequence 
identities between the putative SI domains (the first 740 
residues approximately) of HKU8 to 1A, HKU8 to IB, and 
1A to IB are 38.5, 38.7 and 86.6%, respectively. The 
corresponding sequence identities for the putative S2 
domains are 56.2 (F1KU8 to 1A), 55.9 (HKU8 to IB) and 
91.3% (1A to IB), respectively. Twenty nine, 31 and 30 
potential N-glycosylation sites were predicted in 1A, IB 
and HKU8 S proteins, respectively. Two putative heptad 
repeat (FIR) regions, which are conserved in all CoVs 
(Bosch et al, 2003), were identified in these S proteins 
(both 1A and IB: HR1, aa 973-1122; HR2, aa 1252-1311. 
HKU8: HR1, aa 976-1121; HR2, aa 1261-1317). 
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ORF3 of 1A and IB have the same size of 660 nt and that of 
HKU8 is 669 nt. In terms of amino acid sequence 
similarity, ORF3 is the least conserved gene amongst these 
three viruses (Supplementary Table S3 available in JGV 
Online). Signal anchors in these ORF3 proteins were 
predicted with the probability of 0.545-0.689 by SignalP. 
Multiple transmembrane motifs were also predicted in 
these ORF3 proteins, suggesting these proteins might be 
surface proteins. 

The E genes of these viruses are both the same size of 
225 nt. One transmembrane signal was predicted in each of 
these E proteins (P>0.990), with the N terminus exposed 
to the surface. 

Signal peptides were predicted in these M proteins 
(P> 0.933). Cleavage sites were located between aa 17 and 
18 in these M proteins. The N-terminal signal peptide was 
not observed in other BtCoV M proteins, including BtCoV/ 
512/2005 and HKU2. Three strong transmembrane helices 
and one marginal transmembrane helix were predicted 
with the N terminus exposed to the surface. It has been 
reported that the M protein of the transmissible gastro¬ 
enteritis virus (TGEV) adopts two different topologies in 
the virus envelope with three transmembrane helices and 
the C terminus located inside the virus particle, or with 
four transmembrane helices and the C terminus exposed to 
the virion surface (Escors et al, 2001). This may also be the 
case for M proteins of these group 1 BtCoVs. 

ORF7 is present only in EIKU8, and it is located 
downstream of the N protein. A blast search of the 
ORF7 amino acid sequence revealed no homologous gene 
in GenBank. Prediction of an ORF (or ORFs) downstream 
of the N gene was also reported in the genomes of some 
group 1 CoVs, including TGEV (GenBank accession no. 
NC_002306), feline infectious peritonitis virus (FIPV) 
(GenBank accession no. AY994055), BtCoV/512/2005 
(Tang et al, 2006) and BtCoV HKU2 (GenBank accession 
no. NC_009988). ORF7 in TGEV was shown to produce an 
endoplasmic reticulum associated protein, suggesting it 
plays a role in the association of replication complexes or 
in virion assembly (Tung et al, 1992). The ORF7ab of 
FIPV has been suggested to affect virulence (Elaijema 
et al, 2004). Elowever, there is no significant sequence 
similarity between these ORFs and it is likely that these 
viruses have acquired their unique ORFs from different 
sources. 

The putative core sequence 5'-CUAAAC-3' in the 
transcription regulatory sequence (TRS), which was 
identified in other group 1 CoVs (Dye 8c Siddell, 2005; 
Hiscox et al, 1995; Lau et al, 2007; Pyre et al., 2004), is 
present in the leader TRSs of the studied genomes. They 
start at position 67 in BtCoV 1A and position 68 in both 
BtCoV IB and EIKU8. All the core TRS sequences are 
highly similar to its corresponding leader TRS 
(Supplementary Fig. S2) except the TRSs for all ORF3 
and the E gene of E1KU8. It has been reported that the 
leader TRS can also join the body segment at an unusual 


consensus sequence preceding the ORFs without the 
presence of the core sequence (Hofmann et al, 1993). 
However, we were not able to demonstrate this in these bat 
viruses because none of these viruses could be cultured in 
vitro and we have no means of obtaining sufficient viral 
RNA for detailed investigation. Thus, the authentic TRSs of 
these bat viruses, in particular those for ORF3, are yet to be 
confirmed. 

Phylogenetic analyses of RdRp, helicase, S, E, M and N 
proteins from these BtCoVs were performed. Mouse 
hepatitis virus was included as an out-group in the 
analyses. In all cases, the three viruses studied formed a 
clade that is distinct to group 1 members isolated from 
other animals (Fig. 1). The BtCoVs 1A, IB and HKU8 
viruses phylogenetically cluster with BtCoVs Shandong/ 
977/2006 and Fujian/773/2005 detected in mainland China 
(Fig. 1). Sequence analysis on partial RdRp sequences 
revealed that the Fujian/773/2005 found in Miniopterus 
schreibersii could be a HKU7-like virus (not shown). The 
previously published S gene and helicase gene sequences of 
BtCoV Shandong/977/2006 (Vijaykrishna et al, 2007), 
which was detected in bats of the species Rhinolophus 
ferrumequinum, show a close genetic relationship to the 
three CoVs described in this report. The S protein amino 
acid sequence similarity between BtCoV Shandong/977/ 
2006 and BtCoV 1A is 46.5 %, while BtCoV 1A to HKU8 is 
46.4 % (Supplementary Table S3). A similar relationship was 
also observed for the helicase gene (Shandong/977/2006 to 
1A: 74.2 %; Shandong/977/2006 to HKU8: 75.5 %; 1A to 
HKU8: 76.5 %). Therefore, CoVs related to that found in the 
genus Miniopterus could also be found in other genera of 
bats. Considering the relationship between these CoV S 
proteins, which is one of the major determinants of host 
species specificity of infection (Dveksler et al, 1991), and the 
relatively low detection rates of BtCoVs HKU8 and HKU7 in 
Miniopterus (Chu et al, 2006), it is possible that the two 
viruses are naturally harboured in species of bats other than 
Miniopterus bats. Overall, the above findings highlight that 
some BtCoVs might have the ability for interspecies 
transmission, an event that is relevant to the genesis of 
SARS CoVs in humans. 

It has been previously shown that recombination is not a 
rare event in CoVs (Makino et al, 1986). To detect possible 
recombination between the bat viruses we studied, which 
were all detected in Miniopterus bats found at a single 
habitat, genome sequences of BtCoVs 1A, IB and HKU8 
were aligned and examined by SimPlot and BootScan 
analyses. In addition, these genomes were also aligned with 
other full-length group 1 BtCoV genomes, BtCoV/512/ 
2005 and BtCoV HKU2, for comparison. Bat-SARS-CoV 
was included in the analyses as this virus had been detected 
once in our study (Chu et al, 2006). No evidence of 
recombination between these genomes was detected (data 
not shown). 

The limitation of direct sequencing of PCR amplicons is 
that the presence of co-infections may be masked by the 
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Fig. 1. Phylogenetic analyses of RdRp, Hel, S, E, M and N protein amino acid sequences. S protein of BtCoV HKU2 was not 
included in the analysis as the sequence similarity of this protein to that of all other group 1 CoVs was extremely low (Lau et at., 
2007). MHV was included as an out-group in the analyses. Numbers at nodes indicate bootstrap values as a percentage. Bars 
show the estimated genetic distance of these viruses. GenBank accession numbers of the reference sequences and definition 
of abbreviations are listed in Supplementary Material (available in JGV Online). 
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sequence of the dominant virus. Given the high detection 
rates of BtCoVs 1A and IB in our specimens and by the 
apparent species restriction of these two viruses to two 
different bat species that co-inhabit the same cave, we 
designed species-specific PCR assays for detecting BtCoVs 
1A, IB and HKU8 in our specimens. We applied these 
assays to all specimens that were positive in our CoV 
consensus screening RT-PCR in order to detect the 
presence of co-infections. These viruses were selected for 
screening as they were repeatedly detected in Miniopterus 
bats found within the same cave in Hong Kong. A total of 
38 specimens collected in a period from 2004 to 2006 in 
Hong Kong, which were known to be positive for CoV 
(Chu et al, 2006), were tested by six different species- 
specific RT-PCR assays (RdRp and N genes of BtCoV 1A, 
IB and HKU8). The identities of the amplified products 
were all confirmed by sequencing. In summary, infection 
with 1A alone was detected in 16 specimens collected from 
M. magnater, infection with IB alone was detected in 14 
specimens collected from M. pusillus and infection with 
BtCoV HKU8 alone was detected in one specimen collected 
from M. magnater. There was no evidence of co-infections 
with the more genetically closely related viruses 1A and IB, 
even though these two viruses co-habited the same cave 
and were more frequently detected in our specimens from 
their corresponding hosts. These results suggest that there 
is a strong host restriction for these two viruses that 
apparently hinders interspecies transmission between these 
two species of bat. Interestingly, co-infections of IB and 
HKU8 were detected in seven M. pusillus specimens 
collected in 2004 (n= 2) and 2006 (n= 5) (Table 2). 

The partial N gene sequences (>500 nt) derived from the 
co-infected samples were aligned and a phylogenetic tree 
was constructed (Supplementary Fig. S3 available in JGV 
Online). The analysis revealed that the N gene sequences of 
the IB viruses were not identical (similarities ranging from 
96.8 to 98.8 %), confirming that our findings were not due 
to cross-contamination. On the other hand, three different 
HKU8 N genes sequences were detected (similarities 


ranging from 96.0 to 98.1%) (Supplementary Fig. S3 
available in JGV Online). HKU8 N gene sequences deduced 
from four of five specimens collected in the same sampling 
trip in 2006 were found to be identical. The detection of a 
genetically identical virus in these four specimens could 
represent a recent spread of a single infecting strain within 
this bat population, but the alternative possibility of cross¬ 
contamination at sampling or subsequent procedures 
cannot be conclusively excluded. However, the finding of 
HKU8 and IB co-infections in samples AFCD337 and 
WCF4 with unique sequences in each of these specimens 
(Table 2) and the fact that such co-infections have been 
detected in multiple sampling occasions (2004 and 2006) 
strongly suggest that co-infections of these two viruses in 
M. pusillus occur in nature and are not isolated events. 

In this report, we describe the full genomes of three group 
1 BtCoVs. Co-infection with different CoVs in their host 
provides chances for producing recombinants between 
these CoVs. In our study, there was no evidence of 
recombination events detected between the reported 
genomes of BtCoV 1A, IB and HKU8. But the diversity 
of CoVs found in bats and the detection of BtCoV co- 
infections suggest bats might play a key role in shaping the 
evolution and ecology of CoV. The availability of genetic 
information for these viruses helps us better understand 
the taxonomy and evolution of CoVs in general. But more 
importantly, since SARS and a number of other new 
human diseases have emerged through interspecies trans¬ 
mission of viruses harboured by bats, the biological basis of 
host restriction of bat CoVs remains an important subject 
for study for global public health. 
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Table 2. Specimens co-infected with BtCoV 1 B and HKU8 


The host used was M. pusillus. —, Negative; +, positive; nd, not done. 


Date of collection Sample ID 

(mm/yy) 


RdRp 



N 


1A 

IB 

HKU8 

1A 

IB 

HKU8 

05/04 

WCF4 

- 

+ 

+ 

- 

+ 

+ 

05/04 

WCF14 

- 

+ 

+ 

- 

ND* 

ND* 

03/06 

AFCD307I 

- 

+ 

- 

- 

+ 

+ 

03/06 

AFCD309 

- 

+ 

+ 

- 

+ 

+ 

03/06 

AFCD323 

- 

+ 

+ 

- 

+ 

+ 

03/06 

AFCD325 

- 

+ 

+ 

- 

+ 

+ 

03/06 

AFCD337 

— 

+ 

+ 

— 

+ 

+ 


insufficient RNA samples to perform the assay. 

JThis sample was further tested by RT-PCR assay specific for the nsp3 of HKU8 and was positive in the assay. 
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